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Summary. This article describes a method to compute successive convex approxi- 
mations of the convex hull of a set of points in R n that are the solutions to a system 
of polynomial equations over the reals. The method relies on sums of squares of poly- 
i | normals and the dual theory of moment matrices. The main feature of the technique 

r) is that all computations are done modulo the ideal generated by the polynomials 

fs defining the set to the convexified. This work was motivated by questions raised by 

. Lovasz concerning extensions of the theta body of a graph to arbitrary real alge- 

_C braic varieties, and hence the relaxations described here are called theta bodies. The 

convexification process can be seen as an incarnation of Lasserre's hierarchy of con- 

avex relaxations of a semialgebraic set in R n . When the defining ideal is real radical 
the results become especially nice. We provide several examples of the method and 
discuss convergence issues. Finite convergence, especially after the first step of the 
t-H method, can be described explicitly for finite point sets. 



1 Introduction 



An important concern in optimization is the complete or partial knowledge 
f— ^ of the convex hull of the set of feasible solutions to an optimization problem. 

(^) Computing convex hulls is in general a difficult task, and a classical exam- 

ple is the construction of the integer hull of a polyhedron which drives many 
^. algorithms in integer programming. In this article we describe a method to 

convexify (at least approximately), an algebraic set using semidennite pro- 
r% gramming. 

C^ By an algebraic set we mean a subset S C W 1 described by a finite list 

of polynomial equations of the form f\ (x) = f% (x) = . . . = f t (x) = where 
/,(x) is an element of R[x] := R[xi, . . . , x n ], the polynomial ring in n variables 
over the reals. The input to our algorithm is the ideal generated by /i, . . . , f t , 
denoted as I = (/i,...,/t), which is the set {5Z i=1 <7i/i : 9i *= K[x]}. An 
ideal 7 C R[x] is a group under addition and is closed under multiplication by 
elements of R[x]. Given an ideal JCI[x], its real variety, Vr(T) — {x G R n : 
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/(x) = V / £ 1} is an example of an algebraic set. Given I, we describe 
a method to produce a nested sequence of convex relaxations of the closure 
of conv(Vfa(J)), the convex hull of Vr(I), called the theta bodies of I. The 
fc-th theta body THfc(J) is obtained as the projection of a spectrahedron (the 
feasible region of a semidefinite program), and 

THi(J) D TH 2 (7) D ■ ■ O TH fe (/) D TH fe+1 (7) D • • • D cl(conv(V R (J))). 

Of special interest to us are real radical ideals. We define the real radical of 
an ideal I, denoted as \/l, to be the set of all polynomials / <G R[x] such that 
j2m _|_ ^2 gj £ J f or some gi £ M[x] and mSN. We say that I is a real radical 
ideal if I = \/l. Given a set S C R™, the vanishing ideal of S, denoted as Z(S), 
is the set of all polynomials in R[x] that vanish on S. The Real Nullstellensatz 
(see Theorem 2) says that for any ideal /, X(Vr(I)) = \/l. 

The construction of theta bodies for arbitrary ideals was motivated by 
a problem posed by Lovasz. In [17] Lovasz constructed the theta body of a 
graph, a convex relaxation of the stable set polytope of a graph which was 
shown later to have a description in terms of semidefinite programming. An 
important result in this context is that the theta body of a graph coincides 
with the stable set polytope of the graph if and only if the graph is perfect. 
Lovasz observed that the theta body of a graph could be described in terms 
of sums of squares of real polynomials modulo the ideal of polynomials that 
vanish on the incidence vectors of stable sets. This observation naturally sug- 
gests the definition of a theta body for any ideal in R[x]. In fact, an easy 
extension of his observation leads to a hierarchy of theta bodies for all ideals 
as above. In [18, Problem 8.3], Lovasz asked to characterize all ideals that 
have the property that their first theta body coincides with cl(conv(Vfe(i r ))), 
which was the starting point of our work. For defining ideals of finite point 
sets we answer this question in Section 4. 

This article is organized as follows. In Section 2 we define theta bodies of 
an ideal in M[x] in terms of sums of squares polynomials. For a general ideal 
/, we get that THj.(7) contains the closure of the projection of a spectrahe- 
dron which is described via combinatorial moment matrices from /. When the 
ideal I is real radical, we show that THfc(J) coincides with the closure of the 
projected spectrahedron, and when I is the defining ideal of a set of points in 
{0, 1}", the closure is not needed. We establish a general relationship between 
the theta body sequence of an ideal I and that of its real radical ideal \fl. 

Section 3 gives two examples of the construction described in Section 2. 
As our first example, we look at the stable sets in a graph and describe the 
hierarchy of theta bodies that result. The first member of the hierarchy is 
Lovasz's theta body of a graph. This hierarchy converges to the stable set 
polytope in finitely many steps as is always the case when we start with a 
finite set of real points. The second example is a cardiod in the plane in which 
case the algebraic set that is being convexificd is infinite. 

In Section 4 we discuss convergence issues for the theta body sequence. 
When Vr(I) is compact, the theta body sequence is guaranteed to converge to 
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the closure of coilv(Vr(I)) asymptotically. We prove that when Vr(I) is finite, 
TH fe (J) = cl(conv(Vit(/))) for some finite k. In the case of finite convergence, 
it is useful to know the specific value of k for which THfc(I) = cl(conv(Vfe(i"))). 
This notion is called exactness and we characterize exactness for the first theta 
body when the set to be convexificd is finite. There are examples in which 
the theta body sequence does not converge to c1(coiiv(Vk(-0)). While a full 
understanding of when convergence occurs is still elusive, we describe one 
obstruction to finite convergence in terms of certain types of singularities of 
V R (J). 

The last section gives more examples of theta bodies and their exactness. 
In particular we consider cuts in a graph and polytopes coming from the graph 
isomorphism question. 

The core of this paper is based on results from [5] and [3] which are pre- 
sented here with a greater emphasis on geometry, avoiding some of the alge- 
braic language in the original results. Theorems 5, 7 and their corollaries are 
new while Theorem 9 is from [4] . The application of theta bodies to polytopes 
that arise in the graph isomorphism question is taken from [15]. 

Acknowledgments. Both authors were partially supported by the NSF 
Focused Research Group grant DMS-0757371. J. Gouveia was also partially 
supported by Fundagao para a Ciencia e Tecnologia and R.R. Thomas by the 
Robert R. and Elaine K. Phelps Endowed Professorship. 



2 Theta bodies of polynomial ideals 

To describe the convex hull of an algebraic set, we start with a simple obser- 
vation about any convex hull. Given a set S C R™, cl(conv(£)), the closure of 
conv(5), is the intersection of all closed half-spaces containing S: 

cl(conv(S')) ={pel": Z(p) > V / £ R[x]i s.t. l\ s > 0}. 

From a computational point of view, this observation is useless, as the right 
hand side is hopelessly cumbersome. However, if S is the zero set of an ideal 
I C K[x], we can define nice relaxations of the above intersection of infinitely 
many half-spaces using a classical strengthening of the nonnegativity condi- 
tion l\s > 0. We describe these relaxations in this section. In Section 2.1 we 
introduce our method for arbitrary ideals in M[x]. In Section 2.2 we specialize 
to real radical ideals, which occur frequently in applications, and show that 
in this case, much stronger results hold than for general ideals. 

2.1 General ideals 

Recall that given an ideal / C R[x], two polynomials / and g arc defined to 
be congruent modulo /, written as / = g mod /, if f — g € I. The relation = 
is an equivalence relation on R[x] and the equivalence class of a polynomial / 
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is denoted as / + 7. The set of all congruence classes of polynomials modulo 
7 is denoted as R[x]/7 and this set is both a ring and a K- vector space: 
given f,g€ K[x] and AeR, (/ + I) + (g + 7) = (/ + g) + 7, A(/ + /) = 
Xf + I, and (f+I)(g + I) = fg + I. Note that if/ = g mod 7, then /(s) = g(s) 
for all s e Vr(7). 

We will say that a polynomial ft G K[x] is a sum of squares (sos) modulo 
7 if there exist polynomials gi,...,g r € M[x] such that ft. = X)[=i 5? mod 
7. If ft, is sos modulo 7 then we immediately have that ft is nonnegative on 
Vr(7). In practice, it is important to control the degree of the gi in the sos 
representation of ft, so we will say that ft is k-sos mod 7 if gi, . . . , g r e R[x]fc, 
where M[x]fc is the set of polynomials in M[x] of degree at most fc. The set of 
polynomials that are fc-sos mod 7, considered as a subset of M.[x\2k/I will be 
denoted as E).(I). 

Definition 1. Let I C R[x] oe a polynomial ideal. We define the k-th thcta 
body of I to be the set 

TH fe (J) := {peR": l(p) > /or all I e E[x]i s.i. Z is fc-sos morf J}. 

Since, if / is sos mod / then I > on Vr(I), and THfc(J) is closed, 
cl(conv(V K (7))) C THfe(J). Also, TH fe (7) C TH fe _i(7), since as fc increases, 
we are potentially intersecting more half-spaces. Thus, the theta bodies of I 
create a nested sequence of closed convex relaxations of conv(VR(7)). 

We now present a related semidefinite programming relaxation of Vr(7) 
using the theory of moments. 

For I C R[x] an ideal, let B = {/o + I, f\ +/,...} be a basis for the 
R- vector space R[x]/7. We assume that the polynomials f representing the 
elements of B are minimal degree representatives of their equivalence classes 
fi + I. This makes the set Bk := {fi + 1 € B : deg(/j) < fc} well-defined. In 
this paper we will restrict ourselves to a special type of basis B. 

Definition 2. Let I C R[x] 6e an idea/. ^4 basis B = {/o + 7, /i + 7, . . .} o/ 
R[x]/7 is a 9-basis if it satisfies the following conditions: 

1- /o(x) = 1; 

^- /i( x ) = x i, f ori = l,...,n; 

5. /j is monomial for all i; 

4- if deg(f), deg(/,) < fc iften /j/j + I is in the real span of B2k- 

We will also always assume that B is ordered and that deg(/i) < deg(/i+i). 

Using Grobner bases theory, one can see that if Vr(7) is not contained 
in any proper affine space, then R[x]/7 always has a 0-basis. For instance, 
take the fi in B to be the standard monomials of an initial ideal of 7 with 
respect to some total degree monomial ordering on R[x] (see for example [2]). 
The methods we describe below work with non monomial bases of R[x]/7 as 
explained in [5] . We restrict to a #-basis in this survey for ease of exposition 
and since the main applications we will discuss only need this type of basis. 
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Fix a 8-basis B of M[x]/7 and define ffc[x] to be the column vector formed by 
all the elements of Bk in order. Then f^ [x] f j. [x] T is a square matrix indexed by 
Bk with (i, j)-entry equal to ftfj + 1. By hypothesis, the entries of ffc[x]ffc[x] T 
lie in the M-span of B 2 k- Let {A' } be the set of real numbers such that 
fifj + I = J2f,+ieB 2k A,jfl + I - We now linearize f fc [x]f fe [x] T by replacing 
each element of B 2 k by a new variable. 

Definition 3. Let I, B and { \\j } be as above. Let y be a real vector indexed 
by £>2fc- The /c-th combinatorial moment matrix Mg fc (y) of I is the real matrix 
indexed by B k whose (i,j)- entry is [M)3 k (y)]i,j = J2f l+ ieB 2k X i,jVi- 

Example 1. Let I C R[xi,X2] be the ideal I = (x\ — 2x 2 + x\,x\X2). Then 
a #-basis for L would be B — {l,xi, x 2 ,x 2 ,x 2 ,x 2 , . . .}. Let us construct the 
matrix Mg 1 (y). Consider the vector fi[x] = (lxiX2) T , then 

/ 1 Xi X 2 \ / 1 Xi x 2 \ 

fi[x]fi[x] T = xi x\ xix 2 I = I Xi 2x 2 — xi modi. 
\x 2 xix 2 x\ J \x\ x\j 

We now linearize the resulting matrix using y = (yo,yi,y 2 , ■■ .), where yi 
indexes the zth element of B, and get 

' 2/0 Vi 2/2 
M Bl (y)= I 2/1 22/2 -2/1 
, 2/2 2/3 1 

The matrix M Bk (y) will allow us to define a relaxation of cl(conv(ViR(/))) 
that will essentially be the theta body TH^ (L) . 

Definition 4. Let L C R[x] be an ideal and B a 8-basis o/R[x]/J. Then 

Q Bk (I) := 7r R „({y e R B2k : 2/0 = h M Bk (y) h 0}) 

where 7Tr™ is projection ofy£ M B2fc to (j/i, . . . , y n ), its coordinates indexed by 



The set Qs k {I) is a relaxation of conv(VR(/)). Pick s £ Vr(I) and define 
y s := (Ms) : f t +I £ B 2k ). Theny s (y s )* = M B Jy s ), y s = 1 and7r R „(y s ) = s. 
We now show the connection between Qs k (L) and TH& (I) . 

Theorem 1. For any ideal L C R[x] and any 8-basis B of R[x]/J, we get 
cl(Q Bfc (J))CTH fe (J). 

Proof: We start with a general observation concerning /c-sos polynomials. 
Suppose h = Y^i=i 5 l 2 modi where gi £ R[x] fe . Each ^ can be identified with a 
real row vector §i such that gi (x) = g^ffe [xjmodl, and so gf = f^ [x] 7 ^ §ii k [x] . 
Denoting by Ph the positive semidefinite matrix X)[=i ft ft we § e t M x ) — 
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ffc[x] T f , jiffc[x]mod/. In general, Ph is not unique. Let h be the real row vector 
such that /i(x) = /if2fe[x] mod /. Then check that for any column vector 
y G R B2k , hy = Pf t ■ Mg fc (y), where • stands for the usual entry- wise inner 
product of matrices. 

Suppose p G Qi3 k (I), and y G MP 2k such that y = 1, M Bk (y) > 
and 7rRn(y) = p. Since THfc(J) is closed, we just have to show that for any 
h = ao + X^r=i a i x i *= ^[ x ]i that is k-sos modulo /, h(p) > 0. Since yo = 1 and 

7i"R"(y) = P, h(p) = a + Y]i=i a iPi = a oVo + J2"=i a iVi = hy = P h • M$ k (y) > 
since P ft > and M Bfc (y) y 0. D 

In the next subsection we will see that when / is a real radical ideal, 
cl(Q Bk (I)) coincides with TH fe (I). 

The idea of computing approximations of the convex hull of a semialge- 
braic set in M. n via the theory of moments and the dual theory of sums of 
squares polynomials is due to Lasserre [7, 8, 9] and Parrilo [20, 21]. In his 
original set up, the moment relaxations obtained are described via moment 
matrices that rely explicitly on the polynomials defining the semialgebraic 
set. In [8], the focus is on semialgebraic subsets of {0, 1}" where the equations 
xf — Xi are used to simplify computations. This idea was generalized in [12] 
to arbitrary real algebraic varieties and studied in detail for zero-dimensional 
ideals. Laurent showed that the moment matrices needed in the approxima- 
tions of coiw(Vr(I)) could be computed modulo the ideal defining the variety. 
This greatly reduces the size of the matrices needed, and removes the depen- 
dence of the computation on the specific presentation of the ideal in terms of 
generators. The construction of the set <5e fc (/) is taken from [12]. Since an 
algebraic set is also a semialgebraic set (defined by equations), we could apply 
Lasserre's method to Vr(I) to get a sequence of approximations coiiv(Vr(/)). 
The results arc essentially the same as theta bodies if the generators of / arc 
picked carefully. However, by restricting ourselves to real varieties, instead 
of allowing inequalities, and concentrating on the sum of squares description 
of theta bodies, as opposed to the moment matrix approach, we can prove 
some interesting theoretical results that are not covered by the general theory 
for Lasserre relaxations. Many of the usual results for Lasserre relaxations 
rely on the existence of a non-empty interior for the semialgebraic set to be 
convexified which is never the case for a real variety, or compactness of the 
semialgebraic set which we do not want to impose. 

2.2 Real radical ideals 

Recall from the introduction that given an ideal I C M[x] its real radical is 
the ideal 

\Tl = {/ G R[x] : f 2m + J2 fh e I, m G N, g t G R[x]} . 
The importance of this ideal arises from the Real Nullstcllensatz. 



Convex Hulls of Algebraic Sets 7 

Theorem 2 (Bounded degree Real Nullstellensatz [16]). Let I C R[x] 
be an ideal. Then there exists a function T : N — > N such that, for all polyno- 
mials f e R[x] of degree at most d that vanish on Vtr(I), f 2m + J29i G I f or 
some polynomials gi such that deg(gi) and m are all bounded above by T(d). 
In particular I (Vr(7)) = y7. 

When 7 is a real radical ideal, the sums of squares approach and the 
moment approach for thcta bodies of I coincide, and we get a stronger version 
of Theorem 1. 

Theorem 3. For any I C R[x] real radical and any 6-basis B of R[x]/7, 
cl(Q Bfe (7)) = TH fe (7). 

Proof: By Theorem 1 we just have to show that THfc(J) C c\(Q& k (I)). By 
[22, Prop 2.6], the set Sk(I) of elements of R[x\2k/I that are fc-sos modulo 7, is 
closed when 7 is real radical. Let / € R[x]i be nonnegative on cl(Qg fe (J)) and 
suppose f + 1 Sk{I)- By the separation theorem, we can find y e Mr h such 
that /y < but gy > for all g + I e 27* (7), or cquivalently, P ff • M Bk (y) > 0. 
Since P g runs over all possible positive semidcfinitc matrices of size \B k \, and 
the cone of positive semidcfinitc matrices of a fixed size is self-dual, we have 
M& k (y) >z 0. Let r be any real number and consider g r + I := (/ + r) 2 + I E 
Sk(I)- Then g r y = f 2 y + 2r/y + r 2 yo > 0. Since fy < and r can be 
arbitrarily large, j/o is forced to be positive. So we can scale y to have yo = 1, 
so that 7TRn(y) € Qe k {I)- By hypothesis we then have /(7TR™(y)) > 0, but by 
the linearity of /, /(7TRr,(y)) = fy < which is a contradiction, so / must be 
fc-sos modulo /. This implies that any linear inequality valid for cl(Qe,fc(7)) 
is valid for TH fe (7), which proves TH fe (7) C cl(Qs,fe(7)). □ 

We now have two ways of looking at the relaxations TH^, (I) — one by a 
characterization of the linear inequalities that hold on them and the other by 
a characterization of the points in them. These two perspectives complement 
each other. The inequality version is useful to prove (or disprove) conver- 
gence of theta bodies to cl(conv(Vfe(7))) while the description via semidcfinitc 
programming is essential for practical computations. All the applications we 
consider use real radical ideals in which case the two descriptions of TH^, (I) 
coincide up to closure. In some cases, as we now show, the closure can be 
omitted. 

Theorem 4. Let / C l[x] be a real radical ideal and k be a positive integer. If 
there exists some linear polynomial g that is k-sos modulo I with a representing 
matrix P g that is positive definite, then Qs k {I) is closed and equals THfc(7). 

Proof: For this proof we will use a standard result from convex analysis: Let V 
and W be finite dimensional vector spaces, H C W be a cone and A : V -^ W 
be a linear map such that A(V)n'mt(H) ^ 0. Then (A _1 77)* = A 1 (77*) where 
A' is the adjoint operator to A. In particular, A' (77*) is closed in V , the 
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dual vector space to V. The proof of this result follows, for example, from 
Corollary 3.3.13 in [1] by setting K = V. 

Throughout the proof we will identify R[x]i/I, for all I, with the space Mr 1 
by simply considering the coordinates in the basis Bi. Consider the inclusion 
map A : R Bl = R n+1 -)• R B2k , and let H be the cone in R B2k of polynomials 
that can be written as a sum of squares of polynomials of degree at most 
k. The interior of this cone is precisely the set of sums of squares g with a 
positive definite representing matrix P g . Our hypothesis then simply states 
that A(R Bl ) n int(H) ^ which implies by the above result that A'(H*) 
is closed. Note that H* is the set of elements y in R B2k such that hy is 
nonnegative for all h G H and this is the same as demanding P/, ■ Mj$ h (y) > 
for all positive semidefinite matrices Ph, which is equivalent to demanding 
that M Bk (y) y 0. So A'(H*) is just the set 7r R n+i({y G R B2k : M B Jy) y 0}) 
and by intersecting it with the plane {y G R 2k : yo = 1} we get Q Bk (I) which 
is therefore closed. □ 

One very important case where the conditions of Theorem 4 holds is when 
/ is the vanishing ideal of a set of 0/1 points. This is precisely the case of 
most interest in combinatorial optimization. 

Corollary 1. If S C {0, 1}™ and I = 1(S), then Q Bk (I) = TH fe (7). 

Proof: It is enough to show that there is a linear polynomial g G R[x] such 
that g = ffe[x] T ^4ffc[x] mod / for a positive definite matrix A and some 0-basis 

Boft[x]//. 

Let A be the matrix 

'/ + lc r 



\ c D 

where I + 1 = \Bk\, c G R l is the vector with all entries equal to —2, and 
D <G R lxl is the diagonal matrix with all diagonal entries equal to 4. This 
matrix is positive definite since D is positive definite and its Schur complement 
(I + 1) — c t D~ 1 c = 1 is positive. 

Since xf = %{ mod / for i = l,...,n and B is a monomial basis, for 
any / + I G B, f = / mod /. Therefore, the constant (linear polynomial) 
I + 1 = f fe [x] T Af fe [x] mod I. D 

The assumption that / is real radical seems very strong. However, we now 
establish a relationship between the theta bodies of an ideal and those of its 
real radical, showing that \/l determines the asymptotic behavior of THfe(/). 
We start by proving a small technical lemma. 

Lemma 1. Given an ideal I and a polynomial f of degree d such that — f 2m G 
£k(I) for some m, k G N, the polynomial f + s G £k+4dm{I) for every e > 0. 



f i +s=U^+sX+U-f m )f i - 2m 
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Proof: First, note that for any I > m and any £ > we have 

2 

and so f l + £ is (dZ + fc)-sos modulo i". For er > 0, define the polynomial p(x) 
to be the truncation of the Taylor series of y/a 2 + Aax at degree 2m — 1 i.e., 

2m_1 r9«v 

p(a;) - V (-1)" , — [ >\ -x n . 

When we square p(x) we get a polynomial whose terms of degree at most 
2m — 1 are exactly the first 2m — 1 terms of a 2 + 4ax, and by checking the 
signs of each of the coefficients of p(x) we can see that the remaining terms of 
p(x) 2 will be negative for odd powers and positive for even powers. Composing 
p with / we get 

m— 1 m— 2 

(p(/(x))) 2 = a 2 +4a/(x) + £ ai /(x) 2m + 2 * - £ foJ(x) 2m + 2 * +1 

j=0 j=0 

where the di and 6, are positive numbers. In particular 



a 2 +4a/(x) =p(/(x)) 2 + ]T aj(x) 2 *(-/(x) 2m ) + ]T 6,/(x) 



\2m+2i+l 

4 = 4=0 



On the right hand side of this equality the only term who is not immediately a 
sum of squares is the last one, but by the above remark, since 2m + 2i + 1 > m, 
by adding an arbitrarily small positive number, it becomes (d(2m+2i+l)+k)- 
sos modulo /. By checking the degrees throughout the sum, one can see that 
for any £ > 0, a 2 + 4<r/(x) + £ is (4dm + fc)-sos modulo I. Since a and £ are 
arbitrary positive numbers we get the desired result. □ 

Lemma 1, together with the Real Nullstellensatz, gives us an important 
relationship between the theta body hierarchy of an ideal and that of its real 
radical. 

Theorem 5. Fix an ideal I . Then, there exists a function $ : N — > N such 
that TEWm C TH / t(v / T) for all k. 

Proof: Fix some fc, and let /(x) be a linear polynomial that is k-sos modulo 
\fl . This means that there exists some sum of squares s(x) € R[x]2fc such that 
f-s€ \fl. Therefore, by the Real Nullstellensatz (Theorem 2), -(f-s) 2m e 
Si(I) for l,m < T(2k), where r : N — >• N depends only on the ideal /. By 
Lemma 1 it follows that / — s + e is r(2fc) + 8fcr(2fc)-sos modulo / for every 
e > 0. Let *(fc) := T(2fc) + 8fcr(2fc). Then we have that / + e is *(fc)-sos 
modulo / for all e > 0. This means that for every e > 0, the inequality 
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/ + e > is valid on TH^^) (-0 for / linear and fc-sos modulo \fl. Therefore, 
/ > is also valid on TH^ (fe) (7), and hence, TH^ (fe) (7) C TH fe (^7). D 

Note that the function "J whose existence we just proved, can be noto- 
riously bad in practice, as it can be much higher than necessary. The best 
theoretical bounds on ty come from quantifier elimination and so increase 
very fast. However, if we are only interested in convergence of the theta body 
sequence, as is often the case, Theorem 5 tells us that we might as well assume 
that our ideals are real radical. 



3 Computing theta bodies 

In this section we illustrate the computation of theta bodies on two examples. 
In the first example, Vr(I) is finite and hence conv(VR(/)) is a polytope, while 
in the second example Vr(7) is infinite. Convex approximations of polytopes 
via linear or semidefinite programming have received a great deal of atten- 
tion in combinatorial optimization where the typical problem is to maximize 
a linear function c • x as x varies over the characteristic vectors \ T € {0, l} n 
of some combinatorial objects T. Since this discrete optimization problem 
is equivalent to the linear program in which one maximizes c • x over all 
x G conv({x T }) and conv({x T }) is usually unavailable, one resorts to approx- 
imations of this polytope over which one can optimize in a reasonable way. 
A combinatorial optimization problem that has been tested heavily in this 
context is the maximum stable set problem in a graph which we use as our 
first example. In [17], Lovasz constructed the theta body of a graph which was 
the first example of a semidefinite programming relaxation of a combinatorial 
optimization problem. The hierarchy of theta bodies for an arbitrary poly- 
nomial ideal are a natural generalization of Lovasz's theta body for a graph, 
which explains their name. Our work on theta bodies was initiated by two 
problems that were posed by Lovasz in [18, Problems 8.3 and 8.4] suggesting 
this generalization and its properties. 

3.1 The maximum stable set problem 

Let G = ([n], E) be an undirected graph with vertex set [n] — {1, . . . , n} and 
edge set E. A stable set in G is a set U C [n] such that for all i,j G U, 
{i,j} 4- E- The maximum stable set problem seeks the stable set of largest 
cardinality in G, the size of which is the stability number of G, denoted as 
a(G). 

The maximum stable set problem can be modeled as follows. For each 
stable set U C. [n], let x U € {0,1}™ be its characteristic vector defined as 
{X u )i = 1 if i G U and ( X u )t = otherwise. Let S G C {0, 1}™ be the set of 
characteristic vectors of all stable sets in G. Then STAB(G) := conv(S'G) is 
called the stable set polytope of G and the maximum stable set problem is, 
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in theory, the linear program max{^ i=1 Xi : x G STAB(G)} with optimal 
value a(G). However, STAB(G) is not known apriori, and so one resorts to 
relaxations of it over which one can optimize J^._ 1 Xi. 

In order to compute theta bodies for this example, we first need to view 
S G as the real variety of an ideal. The natural ideal to take is, X(S G ), the 
vanishing ideal of Sq- It can be checked that this real radical ideal is 

I G := {xi -XjV i e [n], XiXj\f {i,j} g E) C R[x 1 ,. ..,#„]. 

For U C [n], let x* 7 := nie;y Xi - From the generators of I G it follows that if 
/ € R[x] , then f = g mod / G where 5 is in the M-span of the set of monomials 
{x u : U is a stable set in G}. Check that 

B := {x u + I G : U stable set in G} 

is a 0-basis of M[x]// G containing 1+/ G ,a;i+/ G , ■ ■ ■ , x n +I G . This implies that 
B k = {x. u + I G ■ U stable set in G, \U\< k}, and for x u > +I G , x^ +I G <= B fc , 
their product is x 17 * 1 - 1 ^ + I G which is + I G if Ui U E/j is not a stable set in 
G. This product formula allows us to compute Mg fc (y) where we index the 
element x u + I G <= B k by the set U. Since S G C {0, 1}™ and I G = 1{S G ) is 
real radical, by Corollary 1, we have that 



TH fc (J G ) = ^ 



ye 



3M y 0, M e Rl^lxl^l such that 
M 00 - 1, 

Ms{i} = M {i}® = M {i}{i} = J/i 

M[/[// = if [7 U [/' is not stable in G 
Mt/(7< = M w w , if [7 U [/' = W U W 



THi(Jg) = ye 



In particular, indexing the one element stable sets by the vertices of G, 

3 M y 0, M e R(«+ 1 ) x (™+ 1 ) such that ' 
. M 00 = l, 
' Af 0l = M i0 = Af« = y, V i e [n] 

M y =0V{«,j}e£ 

Example 2. Let G = ({1, 2, 3, 4, 5}, -E) be a pentagon. Then 

I G = {x\ - Xi Vi = 1,... ,5, £10:2, Z2a;3, £30:4, a; 4 a;5, £125) 

and 

B = {l,.Ti,x 2 ,a;3,a;4,X5,a;iX3,a;ia;4,X2a;4,a;2X5,a;3a;5} + / G . 

Let y € M 11 be a vector whose coordinates arc indexed by the elements of B 
in the given order. Then 



TH!(/ G ) 



ye 



3 y 6 ,...,y w s.t. 



/ 1 2/i 2/2 2/3 2/4 2/5 \ 

2/1 2/1 2/6 2/7 

2/2 j/2 2/8 2/9 

2/3 ft ft j/io 

2/4 2/7 2/8 2/4 

V 2/5 2/9 2/10 2/5 / 



>-0 
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and TH 2 (J G ) 



( 1 

Vi 

2/2 



ye 



3 y 6 , •••, J/io s.t. 



Vi 2/2 

Vx 

2/2 

2/3 2/6 

2/4 2/7 2/8 

2/5 2/9 

2/6 2/6 

2/7 2/7 

2/8 

2/9 



2/3 2/4 2/5 

2/6 2/7 

2/8 2/9 

2/3 2/io 

2/4 



2/6 2/7 2/8 2/9 2/io \ 
2/6 2/ 7 
2/ 8 2/9 



2/6 
2/7 2/8 



2/io 




2/8 
2/9 



2/io 2/5 2/9 2/io 

2/6 

2/7 

2/8 

OO2/9 



2/6 

















;<77 

















2/8 

















y<) 






h0> 



\yio yio 2/io yio / 

It will follow from Proposition 1 below that 

STAB(G) = TH 2 (J G ) C TH^/c). 

In general, it is a non-trivial task to decide whether two convex bodies coincide 
and thus to check whether a given theta body, THfe(7), equals conv(V]R(/)). 
One technique is to show that all linear functions Z(x) such that Z(x) > on 
Vr(I) are fc-sos mod /. Two illustrations of this method appear shortly. 

The first theta body of Iq is exactly the theta body, TH(G), of G as 
defined by Lovasz [19, Lemma 2.17]. The higher theta bodies THfc(/ G ) shown 
above give a nested sequence of convex relaxations of STAB(G) extending 
Lovasz's TH(G). The problem 



c{][> 2 : xeTH(G)} 



can be solved to arbitrary precision in polynomial time in the size of G via 
semidefinite programming. The optimal value of this semidefinite program 
is called the theta number of G and provides an upper bound on a(G). See 
[6, Chap. 9] and [26] for more on the stable set problem and TH(G). The 
body TH(G) was the first example of a semidefinite programming relaxation 
of a discrete optimization problem and snowballed the use of semidefinite 
programming in combinatorial optimization. Sec [14, 18] for surveys. 

Recall that a graph G is perfect if and only if G has no induced odd 
cycles of length at least five, or their complements. Lovasz showed that 
STAB(G) = TH(G) if and only if G is perfect. This equality shows that 
the maximum stable set problem can be solved in polynomial time in the size 
of G when G is a perfect graph, and this geometric proof is the only one known 
for this complexity result. Since a pentagon is not perfect, it follows that for 
G a pentagon, STAB(G) C THi(I G ). We will see in Corollary 3 that when 
Vr(I) is finite, then there is some finite k for which TH^(7) = conv(VR(/)). 
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Since no monomial in the basis B of R[x]/7 G nas degree larger than a(G), 
for any G, STAB(G) = TH Q ( G )(7 G ) which proves that when G is a pen- 
tagon, STAB(G) = TH 2 (7 G ). We now prove a second, general reason why 
STAB(G) = TH 2 (7 G ) when G is a pentagon. 

A simple class of linear inequalities that are valid on STAB(G) are the odd 
cycle inequalties, ^2 ieC %i < — 2 — , where G is an odd cycle in G. 

Proposition 1. For any graph G, all odd cycle inequalities are valid on 
TH 2 (/ G ). 

Proof: Let n — 2fc + 1 and G be an n-cycle. Then 7 G = (xf — Xi, XiXi + \ V i G 
[n]) where x n+ i — X\. Therefore, (1 — a^) 2 = 1 — Xi and (1 — Xi — a^i+i) 2 = 
1 — Xi — x i+ i mod 7 G . This implies that, mod J G , 

pf := {(l-X 1 )(l-X 2 i-X 2 i+l)) 2 =Pi = l-Xi-X2i-X2i+l+XiX2i+XiX 2 i+l. 

Summing over i = 1, .., fc, we get 

k 2fc+l 2fe 

y pf = fc — kx\ — y^ x i + 2_j x i x i m °d 7 G 

j=l j=2 i=3 

since xix 2 and x\Xik+\ lie in 7 G . Define <7i := £i(l — x 2 i+\ — x 2 i +2 )- Then 
9i — 9i ^ Ic and mod 7 G we get that 

fe-l 2fc fc fc-1 2k+l 

^2 g i — ( fc - 1 ) a;i _ X! XlXl ' wnicn implies J^p 2 + J^ .g 2 = fc - J^ x % . 

i— 1 2—3 i— 1 i— 1 i— 1 

Thus, fc — X^i=i s-i i s 2-sos mod J G . 

Now let G be any graph and G be an induced (2fc + l)-cycle in G. Then 
since 7 G C 7 G , fc — ^i=i #« i s 2-sos mod 7 G which proves the result. D 

Corollary 2. 7/ G is an odd cycle of length at least five, STAB(G) = 
TH 2 (7 G ). 

Proof: It is known that the facet inequalities of STAB(G) for G an n = 2fc+l- 
cycle are: 

n 

Xi > V % G [n] , 1 — x t — x i+ i > V i G [n] , and fc — N^ x$ > 

i=i 

(see for instance, [25, Corollary 65.12a]). Clearly, x t is 2-sos mod 7 G for all 
i G [n], and check that 1 — Xi — Xj = (1 — Xi — Xj) 2 mod 7 G for all {i,j} G E. 
By Proposition 1, fc — Y^i=i x i I s 2-sos mod 7 G . If Z(x) G R[x]i such that 
Z(x) > on STAB(G), then Z(x) is a nonnegative linear combination of the 
facet inequalities of STAB(G), and hence Z(x) is 2-sos mod 7 G . D 
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Using the same method as in Proposition 1, one can show that the odd 
antihole and odd wheel inequalities [6, Chap. 9] that are valid for STAB(G) arc 
also valid for TH2(/g)- Schoenebeck [24] has shown that there is no constant k 
such that STAB(G) = TH fc (7 G ) for all graphs G (as expected, unless P=NP). 
However, no explicit family of graphs that exhibit this behavior is known. 



3.2 An infinite variety 

Consider the cardioid with defining equation 

(x\ + x\ + 2xi f = 4(x? + x%). 
This plane curve is the real variety of the ideal I = (h) where 



h 



ZX -i x 



2„2 

2 



Ax\ + Ax x x 



Axi 



Points on this variety are parametrized by the angle 9 made by the line seg- 
ment from the origin to the point, with the ^i-axis, by the equations: 

xi{6) = 2cos9(l-cos9), x 2 {9) = 2sin 6 (1 - cos 6) . 

It can be checked that the half space containing the origin defined by the 
tangent to the cardiod at (xi(9),x 2 (9)) is 



Xl 



sin 29 + sin 9 
2cos 2 6 + cos 6 - 1 



x 2 



sin29 — 2sint 
2cos 2 9 + cos 9 - 



1 



>0. 



We now compute theta bodies of this ideal. Since the defining ideal of the 
cardiod is generated by a quartic polynomial, no linear polynomial is a sum 
of squares of linear polynomials modulo this ideal. Therefore, THi(7) = R 2 . 
A R-vector space basis B of W l x\,X2\/I is given by the monomials in x\ and 
X2 that are not divisible by x\. In particular, 

jOzJ: — i -^ f t *-' 1 7 **-' 2 ; 1 ; •J-' X *-^ 2 ; *^ 2 i 1 i 1 2 1 "^ 1 2 i 2 5 1 2 i "^ 1 *^ 2 ? 1 2 i 2 f I -* j 

and suppose the coordinates of y <G ]R 4 are indexed as follows (j/q = 1) : 



1 


Xl 


x 2 


x\ 


X\X 2 


■ l: 2 


x\ 


x\x 2 


2 


• r 2 


a;fx 2 


2 2 
a 'l a; 2 


3 


2-2 


1 


2/1 


2/2 


2/3 


2/4 


275 


2/6 


2/7 


2/8 


279 


2/io 


2/ii 


2/12 


2/13 



To compute M# 2 (y) we need to express /j/j + 7 as a linear combination 
of elements in B4 for every /, + I, fj + I <G B 2 and then linearize this linear 
combination using y. For example, x\ +1 = —2x\x\ — x\—Ax\—Ax\x\+Ax\+I 
which linearizes to 



T := -2j/ii - 3/13 - 4y 6 - 4y 8 + 4y 5 . 
Doing all these computations shows that 
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TK 2 (I) = {(y u y 2 ) : 3y e 



pl3 



S.t. 



/ 1 2/1 2/2 2/3 2/4 2/5 \ 

2/1 2/3 2/4 2/6 2/7 2/8 

2/2 2/4 2/5 2/7 2/8 2/9 

2/3 2/6 2/7 T 2/io 2/ii 

2/4 2/7 2/8 2/io 2/ii 2/12 

\j/5 2/8 2/9 2/11 2/12 2/13/ 



>-0 



For a given 9 we can find the maximum t£K such that (t cos(9) , t sin(0)) is in 
TH2(I), using an SDP-solvcr. This point will be on the boundary of Tfi2(/), 
and if we vary 9, we will trace that boundary. In Fig. 1 we show the result 
obtained by repeating this procedure over 720 equally spaced directions and 
solving this problem using SeDuMi 1.1R3 [28]. 




Fig. 1. TH2(i) compared with the cardioid. 



From the figure, TH2(I) seems to match the convex hull of the cardiod, 
suggesting that Tfi2(/) = conv(Vg.(I)) . To prove this equality, we would have 
to construct a sos representation modulo the ideal for each of the tangents 
to the cardiod. Independently of that, it is clear from the figure that TH 2 (I) 
does a very good job of approximating this convex hull. 

We now explain how one can also optimize a linear function over a theta 
body using the original sos definition of the body. For a vector c € K™, 
maximizing c • x over THfc(J) is the same as minimizing A G R such that 
A — c • x is nonnegative on TH^(/). Under some mild assumptions such as 
compactness of THfc(I) or I being real radical, this is precisely the same as 
minimizing AgR such that A — c • x is k-sos modulo I. 

Consider the previous example of the cardioid and let c = (1, 1). If we want 
to optimize in that direction over TH2(/), we need to minimize A such that 
£\(xi, x 2 ) := A — x\ — x 2 is 2-sos modulo /. If f2(x) := (1 X\ x 2 x\ X\x 2 x^) 7 ', 
then £\(xi,x 2 ) is 2-sos modulo / if and only if there exists a 6 by 6 positive 
scmidefinitc matrix A such that 



4(x!,a;2) = f2(x) T Af2(x) modi. 
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By carrying out the multiplications and doing the simplifications using the 
ideal one gets that this is equivalent to finding A >z such that 

h(xi,x 2 ) = A n + 2A 12 x x + 2A 13 x 2 + (2A 14 + A 22 )x\ 

+ (2A 15 + 2A 23 )x x x 2 + (2A 16 + A 33 + AA 44 )x 2 2 
+ {2A 24 - 4^44)2;? + (2A 25 + 2A 34 )x\x 2 



-(2A 26 + 2A35 - AA 44 )x x x\ + 2A 36 x% + 2A 4b x\x 2 



i26 

-(2^46 



44 )X 1 X 2 + ZA 3 qX 2 

+ A 55 - 2A 44 )x\x\ + 2A^x x x\ + (A 66 - A 44 )x\ 



so our problem is minimizing A such that there exists ,4^0 verifying 



A n = A; 

Ai 2 = A 13 = 1/2; 

2A 14 + A 22 = 2A 15 + 2A 23 = 2A 16 + A 33 + AA 44 = 2A 24 
= 2A 25 + 2A 34 = 2A 26 + 2A35 - 4A 44 = A 36 = A 45 
= 2A 46 + A 55 - 2A 44 = A 56 = A 66 - A 44 = 



AA 4 







-— -T~~~~~^ 




{ < 








— -__\- 



Fig. 2. Outer contour of TH 2 (J). 



By taking c = (cos(#),sin(#)), varying 9 and tracing all the optimal £\ 
obtained one can get a contour of TH 2 (I). In Fig. 2 we can sec all the l\'s 
obtained by repeating this process 32 times using SeDuMi. 



4 Convergence and Exactness 



In any hierarchy of relaxations, one crucial question is that of convergence: 
under what conditions docs the sequence THk(I) approach cl(conv(VR(/)))? 
Another important question is whether the limit is actually reached in a finite 
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number of steps. In this section we will give a brief overview of the known 
answers to these questions. 

In general, convergence is not assured. In fact, for the ideal I = (x 3 — y 2 \, 
whose real variety is a cusp, the closure of conv(Vfe(i")) is the half plane de- 
fined by x > 0. However, all the theta bodies of I are just M 2 . This follows 
immediately from the fact that no linear polynomial can ever be written as a 
sum of squares modulo this ideal. Fortunately, many interesting ideals behave 
nicely with respect to the theta body hierarchy. In particular, the central re- 
sult in this area tells us that compactness of Vr(7) implies convergence of the 
theta body hierarchy of I. 

Theorem 6. For any ideal I C R[x] such that Vr(I) is compact, 

oo 

f|TH fe (/)=cl(conv(V R (/))). 

k=0 

Proof: Let /i, . . . , fk be a set of generators for /. We can think of Vr(I) as 
the compact semialgebraic set S = {x e K" : ±/i( x ) > 0}. Then Schmiidgen's 
Positivstellensatz [23] applied to S, guarantees that any linear polynomial ^(x) 
that is strictly positive on Vr(I) has a sos representation modulo /. Hence, 
Z(x) > is valid for TH fe (7) for some fc. □ 

In general, however, we are interested in finite convergence more than 
in asymptotic convergence, since in that case the theta bodies can give us a 
representation of the closure of the convex hull of the variety as the projection 
of a spectrahedron, an important theoretical problem on its own. We will say 
that an ideal I is TH fc -exact if TH fe (I) = cl(conv(V R (7))) and TH-exact if 
it is THfc-exact for some fc. We first note that to study TH-exactness of an 
ideal depends only on the real radical of the ideal. 

Theorem 7. An ideal I C R[x] is TH-exact if and only if yl is TH-exact. 

Proof: The "if" direction follows from Theorem 5, while the "only if" direc- 
tion follows from the fact that I C \fl. □ 

An important case in which TH-exactness holds is when Vr(/) is finite. 
Corollary 3. /// is an ideal such that Vr(I) is finite, then I is TH-exact. 

Proof: By Theorem 7 we can assume that I is real radical. If Vr(I) = 
{pi,...,p m }, then we can construct interpolators g\,...,g m <= ^[ x ] such 
that gi(pj) = 1 if j = i and otherwise. Assume that k is the highest degree 
of a gi. Since <?j — g 2 vanishes on Vr(I), gi~ g 2 £ \/I = I and therefore, gi is 
fc-sos mod / for i = 1, . . . , m. 

If /(x) > on V K (7), then check that /(x) = ]T™ 1 f(p l )g i (x) mod I. 
Since the polynomial on the right hand side is a nonncgative combination 
of the gi's which are fc-sos mod /, / is fc-sos mod I. In particular, all the 
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linear polynomials that are nonnegative on Vr(I) are k-sos mod I and so, 
TH fc (J)=conv(V R (J)). "' D 

Under the stronger assumption that Vc(I) is finite, TH-exactness of I 
follows from a result of Parrilo (see [13] for a proof). The work in [10] implies 
that when Vr(I) is finite, the bodies Qi3 k {I) converges in a finite number of 
steps, a slightly weaker result than the one in Corollary 3. Finite varieties are 
of great importance in practice. For instance, they are precisely the feasible 
regions of the 0/1 integer programs. 

To study TH-exactness in more detail, a better characterization is needed. 

Theorem 8. A real radical ideal I C R[x] is TH^-exact if and only if all 
linear polynomials I that are non-negative in Vr(I) are k-sos modulo I . 

Proof: The "if" direction follows from the definition of theta bodies. The 
"only if" direction is a consequence of the proof of Theorem 3 which said that 
when I is real radical, TH t (7) = c\(Qg t (I)) for all t. Recall that in the second 
part of that proof we showed that any linear inequality valid for cl(<3g t (/)) was 
t-sos mod I. Therefore, if THfc(7) = c1(coiiv(Vr(/))), then being nonnegative 
over cl(QB fc (I)) = THfc(J) is equivalent to being nonnegative over Vr(I) giving 
us the intended result. □ 

The last general result on exactness that we will present, taken from [4], 
is a negative one. Given a point p £ Vr(7) we define the tangent space of p, 
T P (I), to be the affine space passing through p and orthogonal to the vector 
space spanned by the gradients of all polynomials that vanish on Vr(I). We 
say that a point p in Vr(I) is convex-singular, if it is on the boundary of 
coiiv(Vk(-0) and T p (I) is not tangent to conv(VR(I)) i.e., T p (I) intersects the 
relative interior of conv(Vfa(/)). 

Theorem 9. An ideal with a convex-singularity is not TH-exact. 

Proof: Let I be an ideal and p a convex-singular point of Vr(I), and 
J = I(Vr(I)). By Theorem 7, it is enough to show that J is not TH-exact. 
Let Z(x) be a linear polynomial that is positive on the relative interior of 
coiiv(Vk(-0) an d zero at P, which we can always find since p is on the bound- 
ary of couv(Vr(I)). If J was TH^-exact for some k, then since J is real radical, 
by Theorem 8 we would be able to write l(x) = ct(x) + <?(x) where a is a sum 
of squares and g E J. Evaluating at p, we see that <r(p) = 0, which since 
a is a sum of squares, implies that Vcr(p) = 0. Therefore, we must have 
VZ = V.g(p). Let q be a point in the relative interior of conv(V]s(-0) that is 
also in T p (I). Then by the definition of T p (I), 

*(q) = (q - P)VZ = (q - p)V.g(p) = 
which contradicts our choice of Z. Hence, I is not TH^-exact. □ 
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Even in the presence of convex-singular points, the theta bodies of the 
defining ideal can approximate the convex hull of the real variety arbitrarily 
well, but they will not converge in finitely many steps. An example of this 
is the ideal / = (x 4 — x 3 + y 2 ). It has a compact real variety with a convex- 
singularity at the origin. Hence we have asymptotic convergence of the theta 
bodies, but not finite convergence. The first theta body of this ideal is M 2 and 
in Fig. 3 we can sec that the next two theta bodies already closely approximate 
the convex hull of the variety 




Fig. 3. The second and third theta bodies of I = (x 4 — x 3 + j/ 2 ) 



In the remainder of this section we focus on a more restricted exactness 
question. Problem 8.3 in [18] motivates the question of which ideals in R[x] arc 
TIT-exact. Of particular interest are vanishing ideals of finite sets of points 
in R" and ideals arising in combinatorial optimization. 

Theorem 10. Let S be a finite set of points in R n , then I = I(S) is TH4- 
exact if and only if for each facet F of the polytope conv(S') there exists a 
hyperplane H parallel to F such that S C F U H . 

Proof: We can assume without loss of generality that conv(5) is full- 
dimensional, otherwise we could restrict ourselves to its affine hull. Assume 
I is THi-exact and let F be a facet of conv(S'). We can find a linear poly- 
nomial I such that l(p) — for all p <G F and l(p) > for all p e S. By 
Theorem 8, I must be 1-sos modulo / which implies / = X)[=i fffmod/ for 
some linear polynomials gi. In particular, l(p) = J2i=i(9i(p)) 2 f° r au P S 6", 
and since I vanishes in all points of S fl F, so must the g^. This implies that 
all the <7j's vanish on the hyperplane {x e R n : l(x) = 0}, since they are 
linear. This is equivalent to saying that every gi is a scalar multiple of I, 
and therefore, I = Xl 2 modI for some nonnegative A. Then I — XI 2 e I im- 
plies that S C {x : l(x) — Xl(x.) 2 = 0} which is the union of the hyperplancs 
{x : Z(x) = 0} and {x : l(x) = -y/l/A}. So take H to be the second hyperplane. 
Suppose now for each facet F of conv(S') there exists a hyperplane H 
parallel to F such that S C F U H . This implies that for any facet F and a 
fixed linear inequality If{*) > that is valid on S and holds at equality on 
F, lp attains the same nonzero value at all points in S \ F . By scaling we 
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can assume that value to be one, which implies that If(1 — If) is zero at all 
points in S, and hence, lp(l — lp) <G I. But then lp = l F + lp(l — lp) is 1-sos 
mod /. By Farkas Lemma, any valid linear inequality for a polytope can be 
written as a nonncgative linear combination of the facet inequalities of the 
polytope. Therefore, any linear polynomial I that is nonnegative over S is a 
positive sum of 1-sos polynomials mod /, and hence, I is 1-sos mod /. Now 
using Theorem 8, I is THi -exact. □ 

We call a polytope P a k- level polytope if for any facet F of P and any 
supporting hyperplane H of F, there are k — \ hyperplanes H±, ...,Hk-i all 
parallel to H such that the vertices of P are contains in H U Hi U . . . U H^-i- 
Theorem 10 states that I(S) is THi-exact if and only if S is the set of vertices 
of a 2-level polytope. Polytopes with integer vertices that are 2-level are called 
compressed polytopes in the literature [27, 29]. 





Fig. 4. The five 2-level polytopes in R 3 up to affine transformations. 



Examples of 2-lcvcl polytopes include simplices, cubes and cross-polytopes. 
In M 3 , up to affine transformations, there are only five different 2-level poly- 
topes and they are shown in Fig. 4. An example of a polytope that is not 
2-level is the truncated cube shown in Figure 5. Three parallel translates of 
the hyperplane spanned by the slanted face are needed to contain all vertices 
of the polytope, and hence this is a 3-level polytope. Combinatorial properties 
of 2-level polytopes can be found in [5] . 




Fig. 5. This polytope is 3-level but not 2-level. 
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The result in Theorem 10 can be generalized to a (weak) sufficient criterion 
for THfc-exactness. 

Theorem 11. If S C W 1 is the set of vertices of a (k + l)-level polytope then 
Z(S') is THj. -exact. 

Proof: As before, we may assume that conv(S') is full-dimensional, and as 
observed in the proof of Theorem 10, it is enough to prove that for every facet 
F of conv(iS'), if hp is a linear polynomial that is zero on F and non- negative 
on S then hF is fc-sos modulo I(S). We can also assume that F is contained in 
the hyperplane {x € K™ : X\ = 0}, and that all points in S have nonncgative 
first coordinate, as all the properties we are interested in are invariant under 
translations. 

Then, by scaling, we may assume hpfjx) = x\, and the (fc+l)Tevel property 
tells us that there exist k positive values a\ , . . . , au such that all points in S 
have the first coordinate in the set {0, a±, ■ . ■ ,<Xk}- Then we can construct 
a one variable Lagrange interpolator g of degree k such that g(0) = and 
g(di) = y/al for i = 1, . . . , k. This will imply that /if(x) = g{xi) 2 modulo 
T{S) and so hF is fc-sos modulo 1{S). D 

This sufficient condition is very restrictive in general, and in fact can be 
arbitrarily bad. 

Example 3. We saw in Corollary 2 that if G is a (2k + l)-cycle then the ideal 
Iq is TH 2 -exact. However notice that we need k + 1 parallel translates of the 
hyperplane k = ^ Xi to cover all the vertices of STAB(G), since the incidence 
vectors of the stable sets in G can have anywhere from to k entries equal to 
one. Theorem 11 would only guarantee that Iq is TH^-exact. 

When Vr(I) is not finite, THi-exactness becomes harder to guarantee. A 
useful result in this direction is an alternative characterization of the first 
thcta body of any ideal /. Given any ideal /, if we take all convex quadratic 
polynomials in /, and intersect the convex hulls of their zero sets, we obtain 
THi(Z) exactly. This result, proved in [5, Theorem 5.4], can be used in some 
simple cases to both prove or disprove THi -exactness. 

Many other questions on convergence remain open. For example, there is 
no known example of a smooth variety whose theta body hierarchy does not 
converge finitely, but there is also no reason to believe that such an example 
does not exist. In all studied examples of smooth hypersurfaces, convergence 
happens at the first non-trivial theta body. Either an explanation for this 
behavior, or a few examples of badly behaved smooth hypersurfaces, would 
be an important first step toward a better understanding of the theta body 
hierarchy for smooth varieties. Non-compact varieties have also not received 
much attention. 
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5 More Examples and Applications 
5.1 The maximum cut problem 

In section 3.1 we computed the theta body hierarchy for the maximum stable 
set problem. Another important problem in combinatorial optimization is the 
maximum cut (maxcut) problem. Given a graph G = ([n],E), we say that 
G C E is a cut of G, if there exists some partition of the vertices into two sets 
V and W such that C is the set of edges between a vertex in V and a vertex 
in W. The maxcut problem is the problem of finding a cut in G of maximum 
cardinality. Theta bodies of the maxcut problem were studied in [3], and in 
this subsection we will present some of those results. 

The model we use is similar to the one used in the stable set problem. 
The characteristic vector of a cut C of G, is the vector xc € { — 1> 1} E defined 
as xc( e ) = -1 if e e C and 1 otherwise. Let Co be the collection of all 
characteristic vectors of cuts of G, and X{Cq) the vanishing ideal of Cq- 
The maximum cardinality cut of G can be found, in principle, by optimizing 
X^ee-E x e over Cg- However, this is a difficult problem and the size of the the 
maxcut can be approximated by optimizing X^eeE^e over THi c (X(Cg))- 

It is not hard to show that I(Cq) is generated by the polynomials 1 — x\ 
for all e £ E, together with the polynomials 1 — riee.fi: x e where K C E is a 
chordless cycle. Using this one can construct a 6>-basis for X(Cg), but to do 
that we need to introduce another combinatorial concept. Given an even set 
T C [n], we call a subgraph H of G a T-join if the set of vertices of H with 
odd degree is precisely T. For example an 0-join is a cycle, and the minimal 
{s, £}-join is the shortest path from s to t. It is clear that there exists a T- 
join if and only if T has an even number of nodes in each of the connected 
components of G. Define Tg '■= {T Q [n] : 3 T — join in G}, and for each 
T G Tg, choose Ht to be a T-join with a minimal number of edges. Define 
T k := {T <ZT G ■■ \H T \ < k}, and note that 71 = {0} U E. Then one can show 
that B = {IIceT^e +X(Cg) ■ T <G Tg} is a 6>-basis for X(Cg), and we can 
therefore identify B with Tg and each Bk with 7fe. 

We can now give a description of the theta bodies of X(Cg)- 

{3 M y 0, M e Ml Tfc l x l r '=l such that 
M TlTl = M T3T4 if T X AT 2 = T 3 AT 4 
In particular, since 71 = {0} U E we get 

3 M > 0, M e RWuExmuE guch that 
M m = M ee = l,\/e€E 
THi(JCGf) = |y£K B :M fc = M e0 = y e V e e E 

M e f = y g if {e, /, g} is a triangle in G 
M e f = Mj/, if {e, /, 5, ft.} is a square in G ^ 
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These relaxations are very closely related to some older hierarchies of relax- 
ations for the maxcut problem. In particular they have a very interesting 
relation with the relaxation introduced in [11]. The theta body relaxation as 
it is, is not very strong, as shown by the following proposition. 

Proposition 2 ([3, Corollary 5.7]). Let G be the n-cycle. Then the smallest 
k for which I{Cg) is THk-exact is k = \n/A\ . 

Convergence can be sped up for graphs with large cycles by taking a 
chordal closure of the graph, computing the theta body for this new graph, 
and then projecting to the space of edges of the original graph. If instead of 
the chordal closure one takes the complete graph over the same vertices, we 
essentially recover the hierarchy introduced in [11]. 

One can use the usual theta body hierarchy together with Theorem 10 
to solve a question posed by Lovasz in [18, Problem 8.4]. Motivated by the 
fact that a graph is perfect if and only if the stable set ideal Iq is THi-exact, 
Lovasz asked to characterize the graphs that were "cut-perfect", i.e., graphs 
G for which X(Ca) is THi-exact. It turns out that such a "strong cut-perfect 
graph theorem" is not too hard to derive. 

Proposition 3 ([3, Corollary 4.12]). Given a graph G, the ideal I(Cg) is 
TEL -exact if and only if G had no K 5 minor and no chordless circuit of length 
greater than or equal to five. 

5.2 Permutation Groups 

In [15, Sect. 4], De Loera et al. study the behavior of the theta body hierarchy 
when applied to ideals associated to permutation groups. Let A C S n be a 
subgroup of permutations and identify each element in A with a permutation 
matrix. We can then see these permutation matrices as vectors in M nxrl and 
define I a to be their vanishing ideal. One of the main goals in [15, Sect. 4] is 
to provide sufficient conditions for the THi-exactness of Ia- 

A permutation group A is permutation summable if for all Pi , ... , P m <G A 
such that all entries of J2 Pi ~ ^ are nonnegative, ^2 Pi — I is itself a sum 
of matrices in A. For example, S n itself is permutation summable as a direct 
consequence of Birkhoff 's Theorem. We say that A is strongly fixed-point free 
if the only permutation in A that has fixed points is the identity. In terms of 
matrices this is equivalent to saying that an element of A is either the identity 
or has a zero diagonal. This trivially implies that all strongly fixed-point free 
groups are permutation summable. 

Proposition 4 ([15, Theorem 4.5]). If A C S n is a permutation summable 
group, then I a is THi-exact. 

A very interesting class of permutation groups is the automorphism groups 
of graphs. Given a graph G = ([n], E), Aut(G) C S n is defined to be the group 
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of vertex permutations a such that {i,j} € E implies {o~(i),o~(j)} £ E. The 
automorphism polytope of G, denoted as P\ n +( G \ > is the convex hull of Aut(G), 
where again we are identifying a permutation with its matrix representation. 
There has been some study of ways to relax this polytope. Sec [30] for instance. 
One such relaxation is the polytope Pq of all the points P £ M. nxn such that 

n 

PA G = A G P- Y, p iJ = !. 1 < 3 < n; 



Y p i,j = !. !<«<"; ^j > 0, 1 < «,j < n; 

where Aq is the n x n adjacency matrix of G. If in the constraints of Pq 
we replace P±j > by Pjj G {0, 1}, then we get precisely Aut(G). In fact, 
Pj^ut(G) 1S ^ e i n tege r hull of Pq i.e., the convex hull of the integer points of 
Pg- If Pg = PAvlUg) then G is said to be compact. It is not hard to show 
that THi(/Aut(G)) — ^ G ' so ^e first theta body provides an approximation 
of PauUg) that is at least as good as the linear approximation Pq, but one 
can actually say more. 

Proposition 5. [15, Theorem 4-4] The class of compact graphs is strictly 
included in the class of graphs with THi -exact automorphism ideal. In par- 
ticular, let Gi,...,G m be k-regular graphs (all vertices have degree k) that 
are compact and G their disjoint union. Then lAut(G) * s THi -exact, but G is 
compact if and only if G\ = • • • = G m . 

Finally note that while theta bodies of automorphism groups of graphs 
have interesting theoretical properties, computing with them tends to be quite 
hard, as there is no easy general way to obtain a good #-basis for this prob- 
lem. In fact, knowing if a graph has a non-trivial automorphism, the graph 
automorphism problem, is equivalent to knowing if a #-basis for lAat(G) ^ as 
more than one element. Determining the complexity class of this problem is 
a major open question. 
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